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Presented here is a draft genome sequence for Staphylococcus agnetis CBMRN 20813338, isolated from a lactating dairy cow with 
subclinical mastitis. The genome is approximately 2,416 kb and has 35.79% G+C content. Analysis of the deduced open reading 
frame (ORF) set identified candidate virulence attributes in addition to potential molecular targets for species identification. 



Received 5 August 2014 Accepted 8 August 2014 Published 4 September 2014 

Citation Calcutt MJ, Foecking MF, Fry PR, Hsieh H-Y, Perry J, Stewart GC, Scholl DT, Messier S, Middleton JR. 201 4. Draft genome sequence of bovine mastitis isolate 
Staphylococcus agnetis CBMRN 20813338. Genome Announc. 2(5):e00883-14. doi:1 0.1 l28/genomeA.00883-14. 

Copyright © 2014 Calcutt et a I. This is an open-access article distributed under the terms of the Creative Commons Attribution 3.0 Unported license. 
Address correspondence to John R. Middleton, middletonjr@missouri.edu. 



Staphylococci are common etiological agents of clinical and 
subclinical bovine mastitis (1). The number of staphylococcal 
taxa discovered to be associated with intramammary infection 
recently increased following the isolation of a new species, Staph- 
ylococcus agnetis, from mastitic milk in Finland (2). Genotypic 
analysis of three housekeeping genes indicated that this species is 
closely related to Staphylococcus hyicus. S. hyicus has been fre- 
quently isolated in mastitis studies (3), although the identification 
of S. agnetis now raises the possibility that some S. hyicus isolates 
may, in hindsight, have been S. agnetis. As only three genes have 
been sequenced for S. agnetis, a draft genome sequence was deter- 
mined for a North American mastitic isolate to identify coloniza- 
tion factors and provide additional loci for molecular discrimina- 
tion from S. hyicus. Strain CBMRN 20813338, isolated from a 
dairy cow in the Canadian Bovine Mastitis Research Network co- 
hort of herds, was selected, as it caused a high milk somatic cell 
count (3,215,000 cells/ml [normal, <200,000 cells/ml]) in the af- 
fected mammary quarter. 

Total DNA from S. agnetis CBMRN 20813338 was subjected to 
454 sequencing and de novo assembly (Newbler; Roche) at The 
Genome Institute at Washington University, St. Louis, MO. The 
resulting 2,415,997-bp genome was encompassed within 45 con- 
tigs, with 46 X coverage. Autoannotation was performed using 
PGAP at NCBI, yielding 2,336 open reading frames (ORFs), 63 
tRNAs, and a predicted 6 rRNA-coding operons. 

Among the staphylococci for which genomic data are available, 
the assembled contigs of CBMRN 20813338 are most similar in 
gene content and organization to those of Staphylococcus chromo- 
genes (4), consistent with the 16S rRNA-based phylogeny of these 
two taxa. Among the regions of variation between S. chromogenes 
and S. agnetis is an approximately 7-kb locus in S. agnetis harbor- 
ing genes for putative gas vesicle proteins (5). Although gas vesicle 
structures have not been reported for any Staphylococcaceae, a 
related gene set was recently identified in two Staphylococcus simu- 



lans genomes (6). PCR analysis revealed the presence of this locus 
in porcine S. hyicus isolates as well (data not shown). 

Noteworthy among the predicted exoproteome of the poten- 
tial virulence factors are two tandemly arranged ORFs with signif- 
icant similarity to staphylocoagulase (7), an ORF with a significant 
match to beta toxin (sphingomyelin phosphodiesterase) of Staph- 
ylococcus aureus (8), three clustered genes encoding putative su- 
perantigens, and a gene encoding a putative hyaluronidase. The 
hyaluronidase gene was unanticipated, as, among the staphylo- 
cocci, hyaluronidase genes previously were identified in S. aureus 
only (9). No ORFs with similarity to the trypsin-like proteases of 
the Exh exfoliative toxin family were identified; such toxins are 
frequently encrypted in S. hyicus strains that cause swine exudative 
epidermitis (10). 

Analysis of the mobile genetic elements revealed two putative 
prophage genomes and 10 distinct insertion sequences that en- 
code transposases belonging to either the IS200 (89% identity) or 
IS1272 (56% identity) families. No clustered regularly interspaced 
short palindromic repeat (CRISPR)/Cas loci were detected. 

This data set is the first draft genome sequence for the species 
and provides insight into the features that may contribute to suc- 
cessful host colonization and patterns of gene variation among the 
Staphylococcaceae. 

Nucleotide sequence accession number. This whole-genome 
shotgun project has been deposited at DDBJ/EMBL/GenBank un- 
der the accession no. JPRT0 1000000. 
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